Picture for Ilia Sucholutsky

Ilia Sucholutsky

Michael Pokorny

Selective QA over Conflicting Multi-Source Personal Memory: A Diagnostic Testbed and Method Comparison

Add code
May 28, 2026
Viaarxiv icon

On-Policy Consistency Training Improves LLM Safety with Minimal Capability Degradation

Add code
May 20, 2026
Viaarxiv icon

Medical Model Synthesis Architectures: A Case Study

Add code
May 10, 2026
Viaarxiv icon

Improving the Efficiency of Language Agent Teams with Adaptive Task Graphs

Add code
May 07, 2026
Viaarxiv icon

Failing to Falsify: Evaluating and Mitigating Confirmation Bias in Language Models

Add code
Apr 02, 2026
Viaarxiv icon

Do Large Language Models Mentalize When They Teach?

Add code
Apr 02, 2026
Viaarxiv icon

Under the Influence: Quantifying Persuasion and Vigilance in Large Language Models

Add code
Feb 26, 2026
Viaarxiv icon

Why Human Guidance Matters in Collaborative Vibe Coding

Add code
Feb 11, 2026
Viaarxiv icon

Identifying, Evaluating, and Mitigating Risks of AI Thought Partnerships

Add code
May 22, 2025
Viaarxiv icon

Using the Tools of Cognitive Science to Understand Large Language Models at Different Levels of Analysis

Add code
Mar 17, 2025
Viaarxiv icon